Credit card fraud has become a very common problem in the modern economy, especially with the rise of online transactions. This paper investigates advanced machine learning techniques for fraud detection, using models such as Random Forest, Gradient Boosting, Support Vector Machines (SVM), and Long Short-Term Memory Networks (LSTM). A hybrid ensemble model is also proposed to enhance the accuracy of detection while reducing false positives and negatives. The experimental results have proven the efficiency of these models, and the hybrid model performed better. [1] [2]
Introduction
Credit cards are widely used for online and offline transactions, offering users short-term credit and convenience. However, credit card fraud has risen significantly due to evolving technologies and vulnerabilities in both physical (POS) and online transactions. Fraud occurs when a card is used without the owner’s permission for financial gain. Common fraud types include POS fraud using hidden skimming devices, card skimming to clone magnetic-stripe data, and phishing schemes that trick cardholders into revealing personal information. Reports such as the NCRB 2020 highlight a major increase in fraud cases, emphasizing the need for stronger detection and prevention systems.
To combat these threats, machine learning techniques are employed to identify suspicious behavior and flag fraudulent activities. Several algorithms support fraud detection:
XGBoost: An efficient gradient-boosting method that builds a sequence of decision trees and uses regularization to prevent overfitting.
LSTM networks: A form of recurrent neural network capable of recognizing temporal patterns in transaction sequences.
Random Forest: An ensemble of decision trees that improves accuracy and reduces variance through random sampling of data and features.
SVM: A supervised learning model that identifies an optimal hyperplane to classify transactions as legitimate or fraudulent.
Hybrid/Ensemble Models: Combine multiple algorithms to enhance robustness and accuracy.
Stacking integrates several base models (e.g., Random Forest, Gradient Boosting, SVM) and uses a meta-model, often Logistic Regression, to improve final predictions.
References
[1] S. B. E. D. Y. Sahin, \"\"A cost-sensitive decision tree approach for fraud detection\",\" Expert Syst. Appl., vol. 40, pp. 5916-5923, 2013.
[2] C. Y. W. L. M. L. Y. Li, \" \"A principle component analysis-based random forest with the potential nearest neighbor method for automobile insurance fraud identification\",\" Appl. Soft Comput...
[3] S. P. S. Subudhi, \"\"Use of optimized Fuzzy C-Means clustering and supervised classifiers for automobile insurance fraud detection\",\" J. King Saud Univ.- Comput. Inf. Sci.
[4] A. A. A. A. O. Adewumi, \"\"A survey of machine learning and nature inspired based credit card fraud detection techniques\",\" Int. J. Syst. Assurance Eng. Manage., vol. 8, no. 2, pp. 937-953, 2017.
[5] A. K. S. S. A. M. A. Srivastava, \"\"Credit card fraud detection using hidden Markov model\",\" IEEE Trans. Depend. Sec. Comput, vol. 5, no. 1, pp. 37-48,, 2008.
[6] M. S. J. T. Quah, \"\"Real-time credit card fraud detection using computational intelligence\",\" Expert Syst. Appl., vol. 35, pp. 1721-1732, 2008.
[7] S. J. K. T. J. C. W. S. Bhattacharyya, \" \"Data mining for credit card fraud: A comparative study\",\" Decision Support Syst., vol. 50, no. 3, pp. 602-613, 2011.
[8] M. K. A. N. S. Halvaiee, \" \"A novel model for credit card fraud detection using artificial immune systems\",\" Appl. Soft Comput.,, vol. 24, pp. 40-49, 2014.
[9] A. K. S. S. A. K. M. S. Panigrahi, \"\"Credit card fraud detection: A fusion approach using Dempster– Shafer theory and Bayesian learning\",\" Inf. Fusion, vol. 10, no. 4, pp. 354-363, 2009.
[10] E. D. N. Mahmoudi, \"\"Detecting credit card fraud by modified fisher discriminant analysis\",\" Expert Syst. Appl., vol. 42, no. 5, pp. 2510-2516, 2015.
[11] M. A. V. L. C. J. M. S. D. Sánchez, \" \"Association rules applied to credit card fraud detection\",\" Expert Syst. Appl.,, vol. 36, no. 2, pp. 3630-3640, 2009.
[12] M. H. O. E. Duman, \" \"Detecting credit card fraud by genetic algorithm and scatter search\",\" Expert Syst. Appl., vol. 38, no. 10, pp. 13057-13063, 2011.
[13] V. R. G. R. R. I. B. P. Ravisankar, \"\"Detection of financial statement fraud and feature selection using data mining techniques\",\" Decision Support Syst , vol. 50, no. 2, pp. 491-500, 2011.
[14] C. S. Y. M. E. Kirkos, \" \"Data mining techniques for the detection of fraudulent financial statements\",\" Expert Syst. Appl., vol. 32, no. 4, pp. 995-1003, 2007.
[15] S. B. Y. F. H. Glancy, \" \"A computational model for financial reporting fraud detection\",\" Decision Support Syst., vol. 50, no. 3, pp. 595-601, 2011.
[16] M. B. D. E. A. S. T. C. I. T. Christou, \"\"Detecting fraud in online games of chance and lotteries\",\" Expert Syst. Appl., vol. 38, no. 10, pp. 13158-13169, 2011.
[17] C.-F. Tsai, \"\"Combining cluster analysis with classifier ensembles to predict financial distress\",\" Inf. Fusion, vol. 16, pp. 46-58, 2014.
[18] D. J. C. J. Y. Z. F. H. Chen, \"\"Application of random forest rough set theory decision tree and neural network to detect financial statement fraud—Taking corporate governance into consideration\",\" Proc. Int. Conf. Intell. Comput, pp. 221- 234, 2014.
[19] C. P. L. K. S. T. W. S. L. M. Seera, \"\"Classification of transcranial Doppler signals using individual and ensemble recurrent neural networks\",\" Neurocomputing, vol. 249, pp. 337-344, 2017.
[20] K. S.-M. V. L. R. G. C. Phua, \"Resilient identity crime detection\",\" IEEE Trans. Knowl. Data Eng., vol. 24, no. 3, pp. 533-546, 2012.
[21] M. W. Powers, \"\"Evaluation: From precision recall and F measure to ROC informedness markednes and correlation\",\" J. Mach. Learn. Technol., vol. 2, no. 1, pp. 37-63, 2011.
[22] C. K. L. M. S. C. P. L. A. K. N. K. Randhawa, \"Credit Card Fraud Detection Using AdaBoost and Majority Voting,\" IEEE Access, vol. 6, pp. 14277- 14284, 2018.
[23] B. P. S. R. S. Lubna Shaikh, \"\"Credit Card Fraud Detection Using Machine Learning Algorithms\",\" 2023